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Description 



Implementing Complex Clock Designs 
in Field Programmable Devices 

Cross Reference to Related Applications 

[0001] The present application is related to and claims priority 
from co-pending US provisional patent application enti- 
tled, "A Method for Easy FPGA Implementation of Designs 
with Complex Clockings", Filed: 4/25/03, Serial Number: 
60/465,928, Attorney Docket Number: TI-36308PS, nam- 
ing as inventors: NATARAJAN et al, and is incorporated in 
its entirety herewith into the present application. 
Background of Invention 

[0002] p ie id 0 f t fr e i nyen ti 0 n 

[0003] The present invention relates to field programmable de- 
vices, and more specifically to a method and apparatus for 
implementing complex clock designs in such devices. 

[0004] Related Art 

[0005] Field programmable devices (FPD) generally refer to pre- 



fabricated logic circuits which can be programmed to im- 
plement a circuit logic. A typical FPD contains many cells, 
which can individually be programmed to one of several 
pre-specified logic blocks (e.g., a logic gate or a sequen- 
tial element) and can be interconnected in a desired fash- 
ion to implement a desired circuit logic. Examples of FPDs 
include FPGAs (field programmable gate arrays) and pro- 
grammable logic devices (PLDs) as is well known in the 
relevant arts. 

[0006] FPDs find application in several areas. For example, an 

entire circuit logic can be implemented using FPDs quickly 
without having to engage in expensive and time- 
consuming tasks such as implementing masks for fabrica- 
tion of individual integrated circuits. As another example, 
FPDs are used for prototyping circuits to ensure at least 
some aspects of the proposed circuit logic can be verified. 

[0007] FPDs generally need to support implementation of com- 
plex clocks since such clocks would be required in at least 
some circuit logics. For example, some or all of derived 
clocks, divided clocks, gated clocks, independently gener- 
ated clocks, etc., may be generated and/or used in differ- 
ent parts of a circuit logic. 

[0008] one typical requirement in having such complex clocks is 



to ensure that the time delay ('skew') between two clock 
signals is within a pre-specified value. If the skew is 
higher than the pre-specified value, various anomalies 
such as unpredictable results may be caused, as is well 
known in the relevant arts. 

[0009] | n one pr j or approach, an FPD (while being manufactured) 
may be designed to provide a small number of clock 
buffers which provided limited skew, thereby addressing 
the problem noted above. However, one problem with 
such an approach is that a circuit logic may contain sev- 
eral more (number of) clock signals, and accordingly the 
corresponding solutions may be inadequate. 

[0010] in view of problems such as above, a designer may spend 
a substantial amount of time addressing the clock related 
problems, and accordingly such solutions are not accept- 
able at least in some environments (e.g., when rapid pro- 
totyping is desirable). What is therefore needed is a 
method and apparatus to implement complex clock de- 
signs in FPDs. 
Brief Description of Drawings 

[0011] various features of the present invention will be described 

with reference to the following accompanying drawings. 
[0012] Figure (Fig.)l is a circuit diagram shown containing a por- 



tion of a FPGA illustrating the problem caused due to 
skew associated with circuit clocks in one prior embodi- 
ment. 

[0013] Figure 2 is a graph depicting the timing diagram illustrat- 
ing the problem associated with the circuit diagram of 
Figure 1. 

[0014] Figure 3 is a circuit diagram of a FPGA implemented ac- 
cording to an aspect of the present invention. 

[0015] Figure 4 is a circuit diagram of a modified sequential ele- 
ment implemented according to an aspect of the present 
invention. 

[0016] Figure 5 is a timing diagram illustrating the operation of a 
FPGA implemented according to an aspect of the present 
invention. 

[0017] | n t he drawings, like reference numbers generally indicate 
identical, functionally similar, and/or structurally similar 
elements. The drawing in which an element first appears 
is indicated by the leftmost digit(s) in the corresponding 
reference number. 
Detailed Description 

[0018] l. Overview 

[0019] According to an aspect of the present invention, a global 



clock signal is used to control the time point of transition 
of sequential elements in a FPD (field programmable de- 
vice) to a next state. The global clock signal may be con- 
nected to the clock inputs of the respective sequential el- 
ements. Circuit clocks, which are otherwise intended to 
control the state transitions (of the sequential elements), 
are used as enable signals for the sequential elements. 

[0020] Due to the use of the circuit clocks as enable signals, the 
sequential elements may undergo the same transitions as 
compared to an implementation in which the circuit clocks 
are connected to the respective clock inputs of the se- 
quential elements. As the timing of the transitions is con- 
trolled by the global clock, any substantial skew between 
the circuit clocks may not affect the functional operation 
of the base sequential elements, thereby overcoming at 
least some of the problems noted above in the back- 
ground section. 

[0021] Thus, a designer may implement complex circuit logic 
without having an in-depth understanding of the clock- 
related issues. Such features may also be of interest at 
least in prototyping situations in which the functional op- 
eration of a circuit logic is sought to be verified since a 
designer may be substantially relieved of several clock 



timing related issues. Thus, the time required for proto- 
typing may be reduced. 

[0022] Accordingly, by using various features of the present in- 
vention, FPDs may be programmed with a circuit logic, 
without being affected by potential skew between various 
circuit clock signals. It is helpful to first understand the 
problem associated with a prior implementation of FPDs, 
and accordingly the description is continued with refer- 
ence to a prior approach which illustrates the problem 
caused due to skew associated with circuit clocks. 

[0023] 2. Example Prior Approach 

[0024] Figure 1 is a circuit diagram shown containing a portion of 
FPGA 100 illustrating the problem caused due to skew as- 
sociated with circuit clocks in one prior embodiment. For 
simplicity of understanding, FPGA 100 is shown contain- 
ing only few sequential elements (e.g., D type flip-flops), 
however, FPGA 100 may contain many other components 
(sequential elements and combinatorial logic) connected 
according to a circuit logic. FPGA 100 is shown containing 
base sequential elements 110, 120, 150 and 160, and de- 
lay block 130. Each block is described in detail below. 

[0025] Base sequential elements 110, 120, 150 and 160 imple- 
ment a circuit logic when clocked according to the re- 



spective circuit clocks 111, 112, 115 and 116 respec- 
tively. Assuming that each base sequential element forms 
a D flip-flop, the data on paths 102, 103, 105, and 106 is 
provided on corresponding output paths 115, 125, 155 
and 165 according to the corresponding circuit clock. It 
should be understood that the circuit clocks can be gen- 
erated independently or from a common system clock. 

[0026] Delay block 130 delays clock signal 113, and the delayed 
clock signal is provided as circuit clock 115 to base se- 
quential element 150. Delay block 130 may contain a 
combinatorial logic to introduce the delay. The delay is 
assumed to cause a substantial skew of circuit clock 115 
in relation to circuit clock 116, and the resulting undesir- 
able results are described below with respect to Figure 2. 

[0027] Figure 2 is a timing diagram illustrating the manner in 
which unpredictable results may be caused due to skew 
associated with circuit clocks. Waveforms 216, 215, 206, 
265 and 255 respectively represent circuit clock 116, cir- 
cuit clock 115, input on path 106, output on path 165 and 
output on path 155. 

[0028] At time point 2071 1 In both this figure and Figure 5, it 

appears that 207 needs to be earlier than time point 210. 
In Figure 5 you had proposed to change it (possibly under 



the assumption that this time point is for the other input), 
the data on path 106 is changed from logic high to logic 
low (0). Before time point 210, data on paths 255 and 265 
is shown at logic high (1). At time point 210, circuit clock 
216 is shown going from 0 to 1, and circuit clock 215 is 
shown following to 1 at time point 220-1 with a skew of 
duration 250. Skew 250 between circuit clocks 116 and 
115 is assumed to be caused by delay block 130. Ideally 
skew 250 should equal zero. 
[0029] Assuming that skew 250 is short, dotted portion 260 rep- 
resents the correct (expected) output on path 155, which 
represents a scenario in which the data (1) on path 165 in 
the previous clock cycle is propagated as the output on 
path 155. 

[0030] on other hand, if skew 250 is long, the 0 value of wave- 
form 206 is propagated to path 165 (waveform 265) after 
time point 210 (the rising edge of circuit clock 116), and 
the propagated data is further propagated to path 155 
(waveform 255) after time point 220-1 (the rising edge of 
circuit clock 115) assuming that the time point 210 is 
sooner (compared to time point 220-1) by at least the 
setup time of base sequential element 150. 

[0031] As may be readily observed, such a result is undesirable. 



Various aspects of the present invention overcome such a 
disadvantage even in the presence of complex clocks as 
described below in further detail. 

[° 032 ] 3. Support for Complex Clocking 

[0033] Figure 3 is a circuit diagram illustrating the details of 
FPGA 300 implemented according to an aspect of the 
present invention. FPGA 300 is shown containing modified 
sequential elements 310, 320, 360 and 370, and global 
clock received on path 350. The components of Figure 3 
are described in relation to the corresponding compo- 
nents of Figure 1 for conciseness. 

[0034] Each modified sequential element (310, 320, 360, 370) is 
shown receiving three inputs, with global clock 350 being 
connected to the clock input of each base sequential ele- 
ment. Each circuit clock (111, 112, 115, and 116) is 
shown connected to the enable input of the corresponding 
base sequential element. 

[0035] Each modified sequential element transitions to a next 
state only after a (e.g., rising) transition of the corre- 
sponding enable input (i.e., the circuit clock). However, 
the specific time point of transition is controlled by a 
transition of global clock 350 (due to connection to the 
clock input). Global clock 350 may be designed to be a 



higher speed clock (compared to the circuit clocks), 
thereby ensuring that the transitions occur soon after the 
transitions on the circuit clocks. 

[0036] D ue to such an implementation, some of the problems 
due to the skew between various circuit clocks is elimi- 
nated/reduced, as described in further detail in sections 
below. The description is continued with reference to the 
details of implementation of an example embodiment of 
modified sequential element 370. 

[0037] 4 Implementation of Modified Sequential Element 

[0038] Figure 4 is a block diagram illustrating the details of a 

modified sequential element implemented according to an 
aspect of the present invention. For illustration, only se- 
quential element 370 is described below, however, other 
sequential elements (310, 320 and 360) may be imple- 
mented similarly. Modified sequential element 370 is 
shown containing edge detect block 410, multiplexor 430, 
and base sequential element 450. Each block is described 
in detail below. 

[0039] Edge detect block 410 generates an enable pulse (on path 
413) for one clock cycle of global clock 350 on receiving a 
rising edge of circuit clock 115. The enable pulse is pro- 
vided as select control signal 413 to multiplexor 430. The 



implementation of edge detect block 410 will be apparent 
to one skilled in the relevant arts by reading the disclo- 
sure provided herein. Base sequential element 150 may 
operate similar to base sequential elements 110/120/150 
and 160, and thus store a value received on path 405 at 
an (rising) edge of global clock 350. 

[0040] Multiplexor 430 selects one of the inputs received on 
paths 105 and 453 as output on path 405 according to 
the logic level received on select control signal 413. Thus, 
the data on path 105 is selected when (in the clock cycle) 
a pulse (generated by edge detect block 410) is received 
on select control signal 413, and the data on path 453 is 
selected otherwise. As global clock 350 operates at a high 
frequency, the output available on path 155 is fed back as 
input to base sequential element 450. 

[0041] a s a result, the data available on path 105 is transferred 

only after the rising edge of circuit clock 105, but the time 
of transfer is controlled byclock 350. The description is 
continued with reference to a timing diagram illustrating 
the operation of the circuits of Figures 3 and 4 in further 
detail. 

[0042] 5 Timing Diagram 

[0043] Figure 5 is a timing diagram illustrating the manner in 



which the problem(s) associated with Figures 1 and 2 may 
be addressed by the circuits of Figure 3 and 4. For con- 
ciseness, only the differences of Figure 5 as compared to 
Figure 2 are described for conciseness. In addition to the 
signals of Figure 2, the timing diagram of Figure 5 is 
shown depicting clock 550 (corresponding to global clock 
350), select control signal 513 (path 413), output565 
(path 165 of Figure 4), and output 555 (path 155 of Figure 
4). Each waveform is described in detail below. 

[0044] Select control signal 513 is shown rising from logic low to 
logic high soon after receiving the rising edge of global 
clock 350 after receiving active edge (at time point 220-1) 
of circuit clock 115. As shown, circuit clock 115 is shown 
going high at time point 220-1, and select control 513 is 
shown rising after time point 550-0 thereafter. The en- 
able signal stays at logic high for one clock duration of 
global clock 350, as shown. 

[0045] unlike in Figure 2, due to the use of the modified sequen- 
tial elements, the transfer of data is postponed to time 
point 550-1 (the rising edge of global clock 350). As both 
the modified sequential elements 360 and 370 transfer 
the corresponding data elements at substantially the same 
time, the problems of Figure 2 are avoided. 



[0046] ^ should be understood that the approaches of above can 
be integrated into several environments, while taking into 
consideration various considerations. Some example con- 
siderations are described below. 

[0047] q Implementation Considerations 

[0048] As may be observed, the frequency of global clock 350 is 
higher than the frequency of other circuit clocks (115 and 
116). The frequency of global clock 350 may be chosen 
taking into account the maximum permissible skew 
among the different circuit clocks. In general, the clock 
duration of global clock 350 needs to be more than the 
maximum skew between all circuit clocks. 

[0049] Even though modified sequential element of Figure 3 is 
shown containing an edge detect block (which typically 
contains an additional sequential element), it should be 
understood that the edge detector circuit may be shared 
by many modified sequential elements. In general, the 
same edge detector circuit may be shared by all the se- 
quential elements in the same cluster domain (which need 
to receive the clock signal at the same time). As a result, 
various aspects of the present invention can be imple- 
mented without substantially more number of sequential 
elements. 



[0050] The global clock may be provided on a low skew path 

such that the transitions are available to all modified se- 
quential elements at substantially the same time. How- 
ever, due to the use of various features of the present in- 
vention, the circuit clocks may be provided on high skew 
paths. As a result, a circuit logic may be implemented on 
a FPD that support only a limited number of (or even one 
single) low skew clock networks. 

[0051] | n addition, it may be appreciated that the circuit clocks 
may need to be either derived from or synchronized with 
the global clock signal. 

[0052] z Conclusion 

[0053] while various embodiments of the present invention have 
been described above, it should be understood that they 
have been presented byway of example only, and not 
limitation. Thus, the breadth and scope of the present in- 
vention should not be limited by any of the above de- 
scribed exemplary embodiments, but should be defined 
only in accordance with the following claims and their 
equivalents. 



